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Human decisional processes result from the employment of selected 
quantities of relevant information, generally synthesized from environmen- 
tal incoming data and stored memories. Their main goal is the production 
of an appropriate and adaptive response to a cognitive or behavioral task. 
Different strategies of response production can be adopted, among which 
haphazard trials, formation of mental schemes and heuristics. In this pa- 
per, we propose a model of Boolean neural network that incorporates these 
strategies by recurring to global optimization strategies during the learn- 
ing session. The model characterizes as well the passage from an unstruc- 
tured/chaotic attractor neural network typical of data-driven processes to 
a faster one, forward-only and representative of schema-driven processes. 
Moreover, a simplified version of the Iowa Gambling Task (IGT) is intro- 
duced in order to test the model. Our results match with experimental data 
and point out some relevant knowledge coming from psychological domain. 
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1. Introduction 

Humans usually categorize incoming information into stable concepts 
which can be upgraded, related and nested one into another. The charac- 
teristics of information are analyzed and classified into (i.e. they activate) 
existing concepts but, whenever they would represent a novelty, they will 
induce the formation of a new concept or the upgrade of the existing ones. 
This adaptive modality of knowledge organization makes cognitive system 
able to classify, store and employ at best incoming information, in order to 
solve the eventual cognitive demand during next steps of processing. Sub- 
sequently, human cognitive or behavioral responses to a given set of inputs 
are built following several and different decisional strategies. The role of 
context, the kind of information and past experiences are central for the 
choice of what kind of decision making will be made. In general, we can 
take into consideration at least five strategies of output production: 

1) Reflexive responses: direct associations of inputs with an out- 
put pattern. They require no attentional resources and are out of possible 
controls. Typical examples are reflexive genetically implemented motor re- 
sponses (e.g. the evade reflex) and associative behaviors (e.g. the Pavlov's 
dog salivation reflex). 

2) Automatic processes: standardized quick responses associated to 
a frequent activation of simple concepts while the behavioral relevance of 
the input/event is under an "alert red-line" (i.e. it requires a behavioral 
response but not a direct attentional monitoring, for instance during the 
vehicle driving). 

3) Routine processes: processes triggered by several related concepts 
or complex events sufficiently frequent to constitute a stereotypical routine. 
Routines can be solved by a script [U |2] and need the emergence of mental 
schemes [3 [3l Hj, namely representations of complex concepts or events 
easily connectable to a fast cognitive or behavioral response. Note that 
even if the strategy requires an attentive control, it doesn't involve the 
same set of cognitive ability needed during a problem solving task, like the 
resolution of a syllogism or the Wason selection task ^ . 

4) Reasoning: higher cognitive strategy of understanding and produc- 
tion, mainly used during the problem solving. Given a set of premises, 
humans seem to employ rules like those involved in formal logic [6], which 
establish the correct formal solution. The prepositional reasoning makes no 
distinctions about the contents of a statement, but deals only with its syn- 
tactical structure. Unfortunately, human judgements are sometimes very 
far from correct formal solutions. For these reasons, a theory of mental 
models [3 [5] has been proposed, which claims that hypothetical-deductive 
reasoning have three stages of thought: an understanding of the premises 
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which leads up to model construction, a formulation of provisional conclu- 
sions, a revision procedure that verifies if other models are possible. Errors 
occur because of working memory limitedness: the bigger is the number of 
models that we have to menage, the harder the problem becomes. So, errors 
are conclusions not rigorously verified. 

5) Heuristic behavior: modus operandi typical of situations in which 
there is either a lack of information, or different mental schemes run in 
conflict, or there is no time to reasoning, or the task is too difficult. In 
these cases, individuals adopt strategies more similar to an attempt rather 
than to the formal solution. Note that, in some cases, the use of heuristics is 
mandatory and constitutes a cognitive bia^. Most important heuristics in 
psychology are anchoring/adjustment, availability, and representativeness 

m- 

While strategies 1-4 belong to a hierarchy of use and exploitation of 
cognitive resources, heuristics take place only after strategy 3, and if there 
are no conditions to apply the other ones or their application fails. Finally, it 
is possible to bring back the aforementioned considerations into the simple 
cognitive model reported in Fig. 1. Neglecting earlier input systems and 
concept stages, our main purpose is to formalize into a neural model the 
cognitive constructs of mental scheme and heuristic, showing how these can 
modify the production of a response. Besides we will propose a simplified 
version of the IGT |1H IT^ in order to test our model. 

The paper is organized as follow. In the next section we introduce the 
neural model, fourth section is devoted to model fitting, and in the final 
sections we show the main results of our simulations and discuss about 
future perspectives. 

2. The Model 

The model assumes that cognitive activities during a task resolution can 
be represented as a Boolean neural network, whose nodes do not necessar- 
ily correspond to single biological neurons but rather to organized sets of 
neurons, named functional areas. We choose this formalization in order to 
remain within the framework of neural domain. 

The basic computational entities, namely the formal neurons, are de- 
scribed by the following parameters: 

1. a: the internal and external state of activation € {-1, -|-1} 

2. b: the threshold € [0, 1] 

^ While the abstract version of the Wason selection task leads to a correct performance 
of 3.9%, the concrete one leads to a correct performance of 91%. These results are 
interpreted recurring to the availability heuristic applied on past experiences. 
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Fig. 1. The cognitive model. Flowchart symbols as defined in [13] 



3. c: the connectivity degree, i.e. the number of afferent synapses of the 
neuron 

The N bipolar neurons are linked by connections, named synapses, each 
bearing a weight € {-1, 0, At time t, the i-th node computes incoming 

signals from afferent neurons and, at time t + 1, produce a signal, i.e. fires, 
according to the following update law: 

= «gn I^X; - b)j (1) 

where sgn(3;) returns the sign of real number x, Wij is the incoming 
weighted synapse of i-th neuron from the j-th one, and Cj = This 
formalization makes the adopted formal neuron similar to that of McCulloch 
& Pitts [2]. 

Dynamics: search of an asymptotic configuration. By gener- 
ating an arbitrary a and 6, and a connection structure W with entries 
Wij uniformly distributed in {-1, 0, +1}, we are able to define the start- 



^ The case 0-weight corresponds to the absence of hnk. We don't allow auto-synapses, 
or self-recurring links. 
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ing configuration C*" , composed by {W, b, jf], i.e. the initial condition of 
the dynamics. Neurons are synchronously updated by applying iteratively 
Eq. ([1]) for a sufficiently large time tmeas, the maximum convergence time 
allowecQ. If W is asymmetric and according to its asymmetry degree, a 
periodical orbit of length / is reached after a transient r, giving the asymp- 
totic configuration (*™eas^ composed by (1^,6,(7*'"'='"'). In the following, all 
the procedures will make use of this concept of asymptotic configuration 
and its eventual distance from the correct behavior. This is motivated from 
the assumption that only stable asymptotical configurations can account for 
stability and invariance of response typical of human cognitive processes. In 
principle, tc can be viewed as the time needed by the cognitive elaboration. 

Training phase: a problem of global optimization. We choose a 
subset of n Boolean functions as possible instances of the training problem. 
For each 7-th function we select A'^^ input neurons from which the remaining 
ones take entries for the update dynamics and just one, that is Nq = 1, 
output neuron from which, after the attainment of a ^*™'!<i« ^ we will measure 
the computed output. 

Moreover, for each 7-th function we generate the corresponding training 
set e"' , for which all possible examples are given by coupling one of the 
inputs ^^'^"^ with the respective wanted output for /i = 1,...,P^. 

The presentation of the entire training set e is consequently just one training 
epoclJl. At the end of each epoch, the error signal E{(^^"^'"''') will be: 



We have just introduced the vector notation for the matrix of synaptic weights W = 
{wij)ij, the state vector <?* = {a\, . . . , (t^), the threshold vector b = (&i , . . . , biv) and 
the connectivity vector c = (ci, . . . , C]v). 
* In principle the longest convergence time tc should be 2^, the maximal periodical 
orbit of a finite size discrete system of A*' unities having 2 possible values. But for 
reasons of feasibility of simulations, fixing a reasonable tmax, tmeaa will correspond 
to min{tc,tmax}- 

n 

^ Consequently, given n functions to learn, the input vector / will have Ni = N] 

7=1 

rt 

elements, the output vector O will have No = Nq = n elements, and the size of 

7=1 

n 

e will be P = P"*", where = . Notably, the input vector / cannot change 

7=1 

entries during dynamical update T becomes 2^~^^ . We notice that, since we 
do not need to test our network on the complementary subset of e^, we can freely 
present, for each 7-th instance of the problem, the entire correspondent training set 
e'' during the training phase. 
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E = { APl^ ^ ' ''^^ "^'^ ^ ' 10 '""-^ (2) 

1 ^ i=N-No+llJ.=l j=l ^ ' 

10 • No otherwise 



where Sf^ and ^f^* are respectively the computed and the wanted output 
for the ^-th pattern of e|f|. The addiction of a term for transient wih justify 
the passage from a computationally slow attractor neural network, proto- 
typical of data-driven processes, to a faster and oriented one representative 
of schema-driven processes. Each asymptotic configuration having E = 0, 
is one of the possible configurations able to solve the n incoming instances 
of the problem and therefore the presented task. 

As there aren't clues as the way (^*™'!''^ have to be modified, we choose to 
proceed by trial and error. Consequently, the learning problem can be asso- 
ciated with a problem of global optimization, where E is the object function 
to be minimized. The choice of the optimization strategy will produce dif- 
ferent behaviors of the network during the learning phase and consequently, 
we will be able to associate them to different cognitive strategies of task 
resolution. 

Haphazard trials. No optimization strategy is applied. Starting from 
an arbitrary generated condition, a series of local perturbations are pro- 
duced, by modifying just one entry either in or 6 selected at random 
with uniformly distributed probability. For each perturbation, the corre- 
sponding El^^"^""-") is calculated. The resulting trend is a random walk of 
E. This non strategic behavior is directly affected by the involved func- 
tional areas, namely by the size of the solution space. Consequently this 
strategy can need a very long time to reach the solution of the problem and 
to produce the correct response to the task. 

Emergence of mental schemes. The optimized asymptotical config- 
uration must be able to store the n presented instances as mental schemes, 
which future activation will produce a fast and cognitively cheap response. 
We choose to formalize the emergence of mental schemes as a simulated an- 
nealing procedure with geometrical cooling ratio cl, fixed once for all at 0.6, 
and by using E as energy. By starting from the arbitrary generated , 
a local perturbation is produced with same modalities of the haphazard tri- 
als. The resulting (" '-"^^"■^ differs in of a value AE from the previous one. 
For the acceptation of the new configuration, we refer to the Metropolis 



We remark that the first term in first case of Eq. (2) is just the normalized average 
Hamming distance generalized for the entire orbital length. 
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algorithm. The new configuration is kept with probability: 

P=i^ (3) 
I exjp{—AE/T) otherwise 

where T will modulate the cooling schedul^. The perturbation procedure 
continues until E((^*'"^'"^'') = for a reasonable number of epochs. At this 
point the system have inferred and stored the n instances. When a future 
behavioral situation will pose the same set of input, the stable reached 
configuration will be reactivated and, having respectively r and E equals to 
zero and I equal to 1, it will produce one of the fastest and correct responses 
admissible by the task. 



3. Results 

Figure 2 shows how the attainment of a configuration having E = 
can depend on the size of the feasible region, in our model function of N. 
During a series of local perturbations all accepted, the greater is A'' the 
more difficult becomes the search of a configuration able to solve the task. 
Anyhow, this dependency is also found by applying the algorithms of global 

a b 



N = 5 N=10 




Time Time 



Fig. 2. Error signal during perturbation phase. Each time step is just one local 
perturbation. The circles marks the occasional configurations with E = 0. a) 
N ^5;h) N = 10. 



optimization during the learning phase. 

Results of the optimization procedure are presented in Fig. 3. Com- 
paring Fig. 3a with Fig. 2b, it is clear what happens to the slow dynamics 
during optimization. Having at the beginning a large temperature T, all 
moves can be accepted in spite of their respective error E, allowing the 



^ The starting To is fixed once for all at 5. Each 10 epochs we sample the acceptance 
probability op. If ap — 0.5 ± 0.2 then T^+i = cl ■ T^. 



passage among basins. By decreasing T, only moves that decrease the error 
E begin to be accepted, see Eq. (4), causing a more exhaustive exploration 
of the small- £^ of the basin up to the reaching of the global minimum. The 
dependence of E from n and shown in Fig. 3b, c can be easily reported to 
the task difficulty, typically correlated with the number of instances of the 
learning problem and the number of involved functional areas. 



a 
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Fig. 3. Mean error signal over 30 learning sessions with Metropolis algorithm. 
Each time step corresponds to the acceptance of just one local perturbation, a) 
Modulation of T on E, fixed = 10, n ^ 1; b) fixed N ^ 10, n variable; c) N 
variable, fixed n = 1. 



Figure 4 shows the passage from an initial unconstrained dynamics to 
an optimized one, ruled by the learning of a scheme. This transition also 
corresponds to a passage from a C*° having one of possible r and / (Fig. 4a), 
to a having r and I respectively equal to zero and one (Fig. 4b). 



a b 
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Fig. 4. Dynamics transition. Neurons activation (dark/black = +1; light/white = 
-1) as function of time. Each time step is just one application of ([1]). 
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4. Testing the model 

In this section we introduce a simphfied version in order to test 

quahtatively the predictions of the model. 

The task consists of trials where a subject must select a card, reporting 
both a term of winning and and a term of loss, from 4 decks (A', B', C and 
D', respectively) of 60 cards. Main goal of the subject is to maximize its 
budget after a 100 trials session. The temporal series of decks are different; 
decks A' and B' promise strong winning in the short period but stronger 
losses in the long period, while C and D', promising small winnings but 
also smaller losses, assure a better budget at the end of the task. 

For our purpose, the task to perform by our network take in considera- 
tion only the two native decks B' (min = -2330; max = 170; mean = -62.5) 
and D' (min = -310; max = 95; mean = -31.25), which length is maintained 
to 60. While the choice of the first card is random, the following ones are 
given by the output of the optimized network, composed by 5 neurons, one 
of which is the input and one the output. At each trial Tr, the network 
computes all the Tr — 1 previous trials as examples into a simulated anneal- 
ing optimization step, using the previous computed choices as computed 
outputs and the terms of winning or loss in the corresponding tempo- 
ral series as wanted outputs This procedure assumes in psychological 
terms an infinite working memory, and explicitly makes use of the heuristic 
of availability; in absence of relevant information about the covered cards, 
humans tend to use available information stored from past trials. Once opti- 
mized, the output computed by the asymptotical configuration will become 
the choice of the new trial, and its value is registered for future choices. 

Figure 5 shows typical behavior of the network while performing the 
simplified IGT. The winnings early promised by B' prematurely infiuences 
the network response in favor of deck B' but, after the first severe losses, the 
functional E associated to B' becomes too large and the computed output 
switch in favor of D'. From this moment on, almost all the perturbations to 
the asymptotical configuration (^^^^'^'> are rejected by the simulated anneal- 
ing and the network quickly produce its choice trial by trial. It is interesting 
to point out that the transition from a regime distinguished by choices in 
B' to one distinguished by choice in D' happens approximately at the same 
time of humans \12\ [T3] , implying that the time scales in which the mental 
model becomes effective are comparable between humans and our model. 

Moreover after the first great loss given by B', network tends to persevere 
with choices in the same deck, as both normal subjects and pathological 



The original IGT is a psychological task used into larger test batteries to study 
qualitatively behavior of normal subjects during simulated real-life decision making. 
In neuropsychological practice, it is administered to pathological gamblers. 



10 




Fig. 5. Behavior of the network over 30 runs. Each time step is just one trial. Red 
starred dots are most frequent choice of B', green for D'. 



gamblers. This strange behavior can be interpreted from a psychological 
point of view as a persistence of the use of the heuristics of anchoring and 
adjustment during earlier trials after the loss, while in our network is due 
to the fact that the error quotas related to B' and D' becomes comparable. 



5. Conclusion and future perspectives 

We have presented preliminary results of application of a Boolean model 
of neural network to relevant cognitive strategies involved in decision making 
tasks. At moment only mental schemes have been stTidied. The choice of a 
such formalization is due to the possibilities that Boolean neural networks 
offers in terms of robustness, ease of simulation and easy generation of 
samples for data fitting. 

The model appears to capture the most relevant psychological knowl- 
edge regarding the domain of application. By shifting from an unstructured 
slow attractor neural network to a quicker forward-only one, it hold in re- 
spect of learning studies about task complexity and the number of employed 
cognitive resources. As defined, mental schemes become fast and adaptive 
cognitive strategies of behavioral response. 

Regarding the model fitting, the behavior of our network on the simpli- 
fied version of the IGT produces results qualitatively comparable with those 
of humans. 

Future studies will be targeted to include into the model aspects regard- 
ing probabilistic and hypothetical-deductive reasoning, while future appli- 
cations will take in consideration pathological gambling. 
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Appendix A 

Implementation algorithm 

The model above described has been implemented with Matlab R2007a. For 
sakes of space, the codes and the temporal series employed for the model 
fitting are available by directly contacting the authors. 
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